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Abstract. For a polynomial / : { — 1, l} n —> C, we define the partition function as 
the average of over all points x 6 { — 1, l} n , where A 6 C is a parameter. We 

present a quasi-polynomial algorithm, which, given such /, A and e > 0 approximates 
the partition function within a relative error of e in N °^ nn ~ lne ) time provided 
|A| < (2Lv / deg"7) — 1 , where L = L(f) is a parameter bounding the Lipschitz constant 
of / from above and N is the number of monomials in /. As a corollary, we obtain 
a quasi-polynomial algorithm, which, given such an / with coefficients ±1 and such 
that every variable enters not more than 4 monomials, approximates the maximum 
of / on { — 1, l} n within a factor of O (U^Ydeg/), provided the maximum is NS for 
some 0 < 5 < 1. If every variable enters not more than k monomials for some fixed 
k > 4, we are able to establish a similar result when 5 > (k — 1 )/k. 


1. Introduction and main results 

(1.1) Polynomials and partition functions. Let { — 1, l} n be the n-dimensional 
Boolean cube, that is, the set of all 2 n n-vectors x — (±1,... , ±1) and let / : 
{ — 1, l) n —» C be a polynomial with complex coefficients. We assume that / is 
defined as a linear combination of square-free monomials: 


( 1 . 1 . 1 ) 


/60 


y tt/x 7 

tc{l,... ,n} 

and x 7 


where 

= IU 

iei 


OLi G C for all I 
for x — (xi,... , x n ), 


where we agree that x® = 1. As is known, the monomials x 7 for I C (1,... ,n} 
constitute a basis of the vector space of functions / : {—1, l} n —> C. 
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We introduce two parameters measuring the complexity of the polynomial / in 
(1.1.1). The degree of / is the largest degree of a monomial x 7 appearing in (1.1.1) 
with a non-zero coefficient, that is, the maximum cardinality |/| such that aj ^ 0: 


deg f = max 1/1. 

I: a^O 

We also introduce a parameter which controls the Lipschitz constant of /: 


L(f) = max 


\ a l\- 


/C{1,... ,n} 

iEl 


Indeed, if dist is the metric on the cube, 

n 

dist (a;, y) = y \xj - y t \ where x = {x lt ,.. , x n ) and y = (j/i,,.. , y n ) 

i=l 


then 

\f(x)~f{y)\ < T(/)dist(x,y). 

We consider { —1, l} n as a hnite probability space endowed with the uniform mea¬ 
sure. 

For A G C and a polynomial / : { —1, l} n —> C, we introduce the partition 
function 

— Y e A/w -Ee A/ . 

2 n ^ 

*€{ —1,1} TI 

Our hrst main result bounds from below the distance from the zeros of the 
partition function to the origin. 

->■ C be a polynomial and let A e C be such 


(1.2) Theorem. Let f : ( — 1, l} n - 

that 

I'M < 

Then 


0.55 


L(f)Vdegf' 


Ee xf ± 0. 

If, additionally, the constant term of f is 0 then 

|Ee A/ | > (0.41) 7 


We prove Theorem 1.2 in Section 4. As a simple example, let / (xi,... ,x n ) — 
X\ + ... + x n . Then 

_ ^ \ n 

e + 2 e j . 
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We have L(f) = deg / = 1 and Theorem 1.2 predicts that Ee^ ^ 0 provided 
|A| < 0.55. Indeed, the smallest in the absolute value root of Ee A ^ is A = 7rz/2 
with |A| = 7r/2 « 1.57. If we pick f(x i,... ,x n ) — ax\ + ... + ax n for some real 
constant a > 0 then the smallest in the absolute value root of Ee A ^ is ni/2a with 
|A| inversely proportional to L(f), just as Theorem 1.2 predicts. It is not clear 
at the moment whether the dependence of the bound in Theorem 1.2 on deg / is 
optimal. 

As we will see shortly, Theorem 1.2 implies that E e x $ can be efficiently computed 
if |Aj is strictly smaller than the bound in Theorem 1.2. When computing Ee A ^, 
we may assume that the constant term of / is 0, since 

Ee A(/+ “) = e Aa Ee A/ 


and hence adding a constant to / results in multiplying the partition function by 
a constant. 

For a given /, we consider a univariate function 

A > Ee A/ . 

As follows from Theorem 1.2, we can choose a branch of 

9 (AHMEe-) for |A| < 

such that g(0 ) = 0. It follows that g( A) is well-approximated by a low degree Taylor 
polynomial at 0. 

(1.3) Theorem. Let f : (—1, l} n — * C be a polynomial with zero constant term 
and let 

9 (A) = M EeV) /or |A, < 

For a positive integer m < 5 n, let 

m \k jk 


T m (f-,X) = Z 


k=1 


X k d A 
Tidx* 


a(a) 


A=0 


be the degree m Taylor polynomial of g(X) computed at X — 0. Then for n > 2 

5r)n 

provided 

(1 ' 3,1> |A| £ mJh/WT 


In Section 3, we deduce Theorem 1.3 from Theorem 1.2. 

As we discuss in Section 3.1, for a polynomial / given by (1.1.1), the value of 
T m (/; A) can be computed in nN°( rn '> time, where N is the number of monomials in 
the representation (1.1.1). Theorem 1.3 implies that as long ase > e~ n , by choosing 
m = 0(lnn — lne), we can compute the value of Ee A ^ within relative error e in 
jyO(inn-ine) (-j ine provided A satisfies the inequality (1.3.1). For e exponentially 
small in n, it is more efficient to evaluate E e x f directly from the definition. 
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(1.4) Relation to prior work. This paper is a continuation of a series of papers 
by the author [Bal5], [Bal6] and by the author and P. Soberon [BS14], [BS16] on 
algorithms to compute partition functions in combinatorics, see also [Rel5]. The 
main idea of the method is that the logarithm of the partition function is well- 
approximated by a low-degree Taylor polynomial at the temperatures above the 
phase transition (the role of the temperature is played by 1/A), while the phase 
transition is governed by the complex zeros of the partition function, cf. [YL52], 
[LY52], 

The main work of the method consists of bounding the complex roots of the par¬ 
tition function, as in Theorem 1.2. While the general approach of this paper looks 
similar to the approach of [Bal5], [Bal6], [BS14] and [BS16] (a martingale type and 
a fixed point type arguments), in each case bounding complex roots requires some 
effort and new ideas. Once the roots are bounded, it is relatively straightforward 
to approximate the partition function as in Theorem 1.3. 

Another approach to computing partition functions, also rooted in statistical 
physics, is the correlation decay approach, see [We06] and [BG08]. While we did 
not pursue that approach, in our situation it could conceivably work as follows: 
given a polynomial / : { —1, l} n —> M and a real A > 0, we consider the Boolean 
cube as a finite probability space, where the probability of a point x G {-1,1}" 
is e A A x )/Ee A A This makes the coordinates x±,... ,x n random variables. We 
consider a graph with vertices x\,... ,x n and edges connecting two vertices ay and 
Xj if there is a monomial of / containing both ay and ay . This introduces a graph 
metric on the variables ay,... ,x n and one could hope that if A is sufficiently small, 
we have correlation decay: the random variable ay is almost independent on the 
random variables sufficiently distant from ay in the graph metric. This would allow 
us to efficiently approximate the probabilities P (ay = 1) and P (ay = —1) and then 
recursively estimate E e A l. 

While both approaches treat the phase transition as a natural threshold for 
computability, the concepts of phase transition in our method (complex zeros of 
the partition function) and in the correlation decay approach (non-uniqueness of 
Gibbs measures) though definitely related and even equivalent for some spin systems 
[DS87], in general are different. 

Theorem 1.3 together with the algorithm of Section 3.1 below implies that to 
approximate E e x f within a relative error of e > 0, it suffices to compute moments 
E f k for k — 0(lne -1 ). This suggests some similarity with one of the results 
of [K+96], where (among other results) it is shown that the number of satisfying 
assignments of a DNF on n Boolean variables is uniquely determined by the numbers 
of satisfying assignments for all possible conjunctions of k < 1 + log 2 n clauses of 
the DNF (though this is a purely existential result with no algorithm attached). 
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Each conjunction of the DNF can be represented as a polynomial 


4>j{ x ) 


2l s h n (1 + CiXi) where 

i€Sj 

Sj C {1,... ,n} and G { — 1,1}, 


and we let 

m 

f ( x ) = ^2M x )- 

3 = 1 

Then the number of points x G {—1,1}" such that f(x) > 0 is uniquely determined 
by various expectations E (f)j 1 ■ ■ ■ (j)j k for k < 1 + log 2 n. The probability that f(x ) = 
0 for a random point x G {— 1. 1} n sampled from the uniform distribution, can 
be approximated by Ee _A ^ for a sufficiently large A > 0. The expectations are 
precisely those that arise when we compute the moments E f k . It is not clear 
at the moment whether the results of this paper can produce an efficient way to 
compute the number of satisfying assignments. 

2. Applications to optimization 

(2.1) Maximizing a polynomial on the Boolean cube. Let / : (-1, 1}- —► 
1 be a polynomial with real coefficients defined by its monomial expansion (1.1.1). 
As is known, various computationally hard problems of discrete optimization, such 
as finding the maximum cardinality of an independent set in a graph, finding the 
minimum cardinality of a vertex cover in a hypergraph and the maximum constraint 
satisfaction problem can be reduced to finding the maximum of / on the Boolean 
cube (—1, l} n , see, for example, [BH02], 

The problem is straightforward if deg / < 1. If deg f — 2, it may already be quite 
hard even to solve approximately: Given an undirected simple graph G — (V. E) 
with set V — (1,... ,n} of vertices and set E C (^) of edges, one can express 
the largest cardinality of an independent set (a set vertices no two of which are 
connected by an edge of the graph), as the maximum of 

1 n 1 

/(^) = 2 + 1 ) _ 4 (! + Xi) (! + Xj) 

i=l {i,j}eE 

on the cube { —1, l} n . It is an NP-hard problem to approximate the size of the 
largest independent set in a given graph on n vertices within a factor of n 1 ~ e for 
any 0 < e < 1, fixed in advance [HaOl], [Zu07]. If deg / = 2 and / does not contain 
linear or constant terms, the problem reduces to the max cut problem in a weighted 
graph (with both positive and negative weights allowed on the edges), where there 
exists a polynomial time algorithm achieving an 0(ln n) approximation factor, see 
[KN12] for a survey. 
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If deg f > 3, no efficient algorithm appears to be known that would outperform 
choosing a random point x E {— 1, l} n . The maximum of a polynomial / with 
deg / = 3 and no constant, linear or quadratic terms can be approximated within 
an O ( \fnj hi nf factor in polynomial time, see [KN12], Finding the maximum of 
a general real polynomial (1.1.1) on the Boolean cube { —1, l} n is equivalent to 
the problem of finding the maximum weight of a subset of a system of weighted 
linear equations over Z 2 that can be simultaneously satisfied [HV04], Assuming 
that deg / is fixed in advance, / contains N monomials and the constant term of / 
is 0, a polynomial time algorithm approximating the maximum of / within a factor 
of O(VN) is constructed in [HV04], More precisely, the algorithm from [HV04] 
constructs a point x such that f(x) is within a factor of 0(\/N) from \ a A f° r 

/ defined by (1.1.1). If deg / > 3, it is unlikely that a polynomial time algorithm 
exists approximating the maximum of / within a factor of 2 (ln N] for any fixed 
0 < e < 1 [HV04], see also [HaOl]. 

Let us choose ^ 

2L(/) v / deg/ 

as in Theorem 1.3. As is discussed in Section 3.5, by successive conditioning, we 
can compute in jV°( lnn_lne ) time a point y E {—1, l} n which satisfies 

(2.1.1) e A/(y) > (l-e)Ee A/ 

for any given 0 < e < 1. 

How well a point y satisfying (2.1.1) approximates the maximum value of / on 
the Boolean cube { — 1, l} n ? We consider polynomials with coefficients —1,0 and 
1, where the problem of finding an x E { — 1, l} n maximizing f(x) is equivalent to 
finding a vector in ITf satisfying the largest number of linear equations from a given 
list of linear equations over Z 2 . 

(2.2) Theorem. Let 

f ( x ) = ai * 1 

ieT 

be a polynomial with zero constant term, where T is a family of non-empty subsets 
of the set {1,... , n} and aj — ±1 for all I E T. Let 

max f(x) — dlJ 7 ] for some 0 < <5 < 1. 
xef-1,1} 71 

Suppose further that every variable aq enters at most four monomials x 7 for I E T. 
Then 

f q \2s:2 'i 

Ee^ > exp < ^ \iF\ > for 0 < A < 1. 

Since E / = 0, the maximum of / is positive unless J- = 0 and / = 0. It is not- 
clear whether the restriction on the number of occurrences of variables in Theorem 
2.2 is essential or an artifact of the proof. We can get a similar estimate for any 
number occurrences provided the maximum of / is sufficiently close to \ J-\. 
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(2.3) Theorem. Let 


f(x ) = «/x 7 

ieT 

be a polynomial with zero constant term, where 7 is a family of non-empty subsets 
of the set {1,... ,n} and aj = ±1 for all I 6 J. Let k > 2 be an integer and 
suppose that every variable Xi enters at most k monomials x 7 for I G 7. If 


then 


max f(x) > 
cce{—i,i}^ 


k - 1 
k 


7 


Ee^ > exp 



for all 0 < A < 1. 


We prove Theorems 2.2 and 2.3 in Section 5. 

Let / be a polynomial of Theorem 2.2 and suppose that, additionally, |/| < d 
for all I G 7, so that deg / < d. We have L(f) < 4 and we choose 


8 VS' 

Let y G ( — 1, l} n be a point satisfying (2.1.1). Then 


f(y) > 


1 

A 


In E e x f + 


ln( 1 — e) 
A 


> 


3AA 2 ln(l — e) 
16 1 1 + A 


That is, if the maximum of / is at least <5| 7\ for some 0 < 5 < 1, we can approximate 
the maximum in quasi-polynomial time within a factor of O 1 ^cl'j. Equivalently, 

if for some 0 < 5 < 0.5 there is a vector in Z 2 satisfying at least (0.5 + 5)|A r | 
equations of a set 7 of linear equations over Z 2 , where each variable enters at most 
4 equations, in quasi-polynomial time we can compute a vector v G Z 2 satisfying 
at least (0.5 + <5i)|J r | linear equations from the system, where hi = Q(h 2 / Vd) and 
d is the largest number of variables per equation. 

Similarly, we can approximate in quasi-polynomial time the maximum of / in 
Theorem 2.3 within a factor of 0(k\fd) provided the maximum is sufficiently close 
to lA 7 !, that is, is at least ■^-|A r |. 

In Theorems 2.2 and 2.3, one can check in polynomial time whether the maximum 
of / is equal to IT), as this reduces to testing the feasibility of a system of linear 
equations over Z 2 . However, for any fixed 0 < S < 1, testing whether the maximum 
is at least h|T’| is computationally hard, cf. [HaOl]. 

Hastad [HaOO] constructed a polynomial time algorithm that approximates the 
maximum of / within a factor of O(kd). In [B+15], see also [Hal5], a polynomial 
algorithm is constructed that finds the maximum of / within a factor of e°^Vk, 
provided / is an odd function. More precisely, the algorithm finds a point x such 
that f{x) is within a factor of e°^Vk from |T"|. 
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3. Computing the partition function 


(3.1) Computing the Taylor polynomial of g(X) = In (Ee A ^). First, we dis¬ 
cuss how to compute the degree m Taylor polynomial T m (/;A) at A — 0 of the 
function 

9(A) = In (E AO, 

see Theorem 1.3. Let us denote 

h( A) = Ee A ^ and g(X) — lnh(A). 


Then 


Therefore, 


/ h' 

g' = — and hence h! — g'h. 
h 


(3.1.1) M fc) (0) = ^ ^ (i) (0)h (fc - j) (0) for k = 1,.. 

If we calculate the derivatives 

(3.1.2) h( 0), A< 1 >(0),...,fc<’">(0), 


then we can compute 

9(0), 9< 1 >(0),...,g<’">(0) 

by solving a non-singular triangular system of linear equations (3.1.1) which has 
h( 0) = 1 on the diagonal. Hence our goal is to calculate the derivatives (3.1.2). 

We observe that 

ft< ‘ ) (°) = ^ E /‘w = e/‘. 

*€{ — 1 , 1 }" 

For a polynomial / defined by its monomial expansion (1.1.1) we have 


E/ = ex®. 

We can consecutively compute the monomial expansion of /, / 2 ,... , / m by using 
the following multiplication rule for monomials on the Boolean cube (—1, l} n : 

x 7 x J = x /AJ , 


where JAJ is the symmetric difference of subsets /, J C { 1 ,... , n}. It follows then 
that for a polynomial / : ( — 1, l} n —> C given by its monomial expansion (1.1.1) 
and a positive integer m, the Taylor polynomial 


T m (/;A) 


E 

fc=i 


X k d k 
~k\dX k 




A=0 


can be computed in nN °( m ) time, where A r is the number of monomials in /. 

Our next goal is deduce Theorem 1.3 from Theorem 1.2. The proof is based on 
the following lemma. 



(3.2) Lemma. Let p : C —» C be a univariate polynomial and suppose that for 
some f3 > 0 we have 

p(z) ^ 0 provided \z\ < f3. 

Let 0 < 7 < f3 and for \z\ <7, let us choose a continuous branch of 

g(z) = In p(z). 


Let 


Tm(z) — g{ 0) + ^ 


k =1 


z k d k 
k\ dz k 


9(z) 


z =0 


be the degree m Taylor polynomial of g(z) computed at z — 0. Then for 

r > 1 
7 


we have 


\g(z)~T m (z) | < 


deg p 


for all \z\ < 7. 


(m + 1 )r m (r — 1) 

Proof. Let n — deg p and let ai,... , a n be the roots of p, so we may write 


i=l 


il -') 

where | 

V «./ 



Then 

g{z) = 7(0) + I 11 f 1 “ -7) > 

1 \ J 

i—l / 

where we choose the branch of the logarithm which is 0 when z = 0. Using the 
Taylor series expansion of the logarithm, we obtain 



provided 


A < 7, 


where 


I Cm 


+ OO 

- £ 

k = m -\-1 



< 


+00 

£ 


k=m -\-1 



< 


1 

(m + 1 )r m (r — 1) 


Therefore, 


g(z)=g( 0) - 

i=1 fc=1 


ka k 

9 


T 7m 


for 


A < 7, 



where 


n 


\Vm 


< 

(to + 1 )r m (r — 1) 


It remains to notice that 


T m (z)-g( 0) 


□ 

Next, we need a technical bound on the approximation of e z by its Taylor poly¬ 
nomial. 

(3.3) Lemma. Let p > 0 be a real number and let m > 5 p be an integer. Then 

< e~ 2p for all z G C such that \z\ < p. 


m L 

_ /y 

Y- 


k =0 


Proof. For all z G C such that \z\ < p, we have 


m u 

E Z K 

~V\ 


k =0 


+°° y k 

Y — 

^ k\ 


k=m-\-l 
rn 


< 


+°° ,.k 

y — 

k< 

k=m -\-1 


p 


i+l +°° n k 


p k (m + 1)! 
(to + 1)! (k + to + 1)! 


E 


< 


p"-+1 pk pm+l e p pm+lgP+m+1 


(to + 1)! ^ k\ 


(to + 1)! “ (to T l) m + 1 


Since to > 5p, we obtain 


+°° v k 

e “-E« 


fc=o 


< 


pTn+lgP+ra+l g p 

5 m +V m +i ~~ (5/e) m+1 “ (5 /e) 5 P 


< 


< e~ 2p . 


and the proof follows. □ 

(3.4) Proof of Theorem 1.3. Without loss of generality, we assume that L(f ) = 
1. Since the constant term of / is 0, for any x G {—1,1}", we have 


\f{x)\ < m - n - 

i= 1 I: iEl 


Applying Lemma 3.3, we conclude that 


e 


yA ( A /(*))* 

^ k\ 


k =0 


< e~ 2n 
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(3.4.1) 


A/(sc) 


for all x G {—1, l} n 



provided |A| < 1. Let 


PW = ! + 


^A fc d k 


k\ cl\ k 


(Ee xf ) 


be the degree 5 n Taylor polynomial of the function A i—> E e x f at A 

(3.4.1) it follows that 

|Ee A/ -p(A)| < e~ 2n provided |A| < 1. 

From Theorem 1.2, we conclude that 


= 0. From 


p{ A) ^ 0 for all A G C such that |A| < 


/ deg f 


and, moreover, 


(3.4.2) llnp(A) — In (Ee^) I < e n provided |A| < 33 and n > 2 . 

v deg / 

Applying Lemma 3.2 with 

„ 0.55 0.5 J [3 1 , 

P = n - T -. 7 = n - j and r = - = 1 . 1 , 

V deg / v deg / 7 

we conclude that for the Taylor polynomial of lnp(A) at A = 0, 

171 ^ ^ jfc 

T m (A) = ln P ( 0 ) + J] ¥ 5 ^l„ P (A)| > _ o 


we have 

50r? 1 

(3.4.3) |T m (A) - lnp(A)| < (m+1)(11)m Prided |A| < ^=. 

It remains to notice that the Taylor polynomials of degree m < 5 n of the functions 


A i—> In (E e A ^) and A i—> lnp(A) 


at A = 0 coincide, since both are determined by the first m derivatives of respectively 
Ee A ^ and p{ A) at A = 0, cf. Section 3.1, and those derivatives coincide. The proof 
now follows by (3.4.2) - (3.4.3). □ 
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(3.5) Computing a point y in the cube with a large value of f(y). We 
discuss how to compute a point y G {—1, l} n satisfying (2.1). We do it by successive 
conditioning and determine one coordinate of y = (jq,... ,y n ) at a time. Let F + 
and F~ be the facets of the cube ( — 1, l} n defined by the equations x n — 1 and 
x n = —1 respectively for x — (aq,... ,x n ), x G {—1, l} n . Then F + and F~ can be 
identified with the (n — 1)-dimensional cube ( — 1, l } n “ 1 and we have 

Ee A/ = -E (e A/ |F+) + -E (e A/ |F“) . 

2 2 

Moreover, for the restrictions f + and f~ of / onto F + and F~ respectively, con¬ 
sidered as polynomials on { — 1 , l} n_1 , we have 

deg/+, deg/ - < deg/ and L(f + ), L(f~) < L(f). 

Using the algorithm of Section 3.1 and Theorem 1.3, we compute E (e A ^|E + ) and 
E (e x ^\F~) within a relative error e/2n, choose the facet with the larger computed 
value, let y n — 1 if the value of E (e A ^ | F + ) appears to be larger and let y n = — 1 if 
the value of E (e A ^|E _ ) appears to be larger and proceed further by conditioning 
on the value of y n - 1 - For polynomials with N monomials, the complexity of the 
algorithm is lY°( lnn ). 


4. Proof of Theorem 1.2 


To prove Theorem 1.2, we consider restrictions of the partition function onto 
faces of the cube. 

(4.1) Faces. A face F C (—1, l} n consists of the points x where some of the 
coordinates of x are fixed at 1 , some are fixed at —1 and others are allowed to 
vary (a face is always non-empty). With a face F. we associate three subsets 
J+(F),I_(F), J(F) C {1,... ,n} as follows: 

/-I -(F) ={i : Xi — 1 for all x E F, x — (aq, ... , x n )}, 

I-(F) ={i : Xi — — 1 for all x € F, x = (aq,... ,x n )} and 
1(F) ={1, ... ,n} \ (I + (F) U I-(F)). 


Consequently, 


F 


,... , x n ) where 


Xi — 1 for i G I+(F) and 
Xi = —1 for i G J_(F)|. 


In particular, if u (F) = I-(F) = 0 and hence 1(F) = {1,... ,n}, we have 
F = ( — 1, l} n . We call the number 


dim F - |J(F)| 
12 



the dimension of F. 

For a subset J G {1,. .. , n}, we denote by { — 1,1 } J the set of all points 
x — (xj : j G .7) where Xj = ±1. 

Let F C { — 1, l} n be a face. For a subset J C I{F) and a point e G { — 1, 1} J , 
e = (ej : j G J), we define 

F £ = {i G F, x = (xi,... , x n ) : Xj = ej for j G J}. 

In words: F e is obtained from F by fixing the coordinates from some set J C 1(F) 
of free coordinates to 1 or to —1. Hence F £ is also a face of { — 1, l} n and we think 
of F e C F as a face of F. We can represent Fas a disjoint union 

(4.1.1) F= (J F e for any J C 1(F). 

(4.2) The space of polynomials. Let us fix a positive integer d. We identify 
the set of all polynomials / as in ( 1 . 1 . 1 ) such that deg / < d and the constant term 
of / is 0 with C N , where 


N = N(n, d) 


d 


£ 



For 5 > 0, we consider a closed convex set U(5) C consisting of the polynomials 
/ : (—1, l} n —>■ C such that deg f < d and L(f) < 6. In other words, U(5) consists 
of the polynomials 

/(*) = ayx 7 where \ai\ < 5 for i = 1, ... ,n. 

7C{1,... ,n} 7: *67 

l<|7|<d 

(4.3) Restriction of the partition function onto a face. Let / : (—1, l} n —» 
C be a polynomial and let F C { —1, l} n be a face. We define 

E £lL = ^£<= /w - 

xEF 


We suppose that / is defined by its monomial expansion as in (1.1.1) and consider 
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E (ef\F) as a function of the coefficients ay. Using (4.1.1) we deduce 

—E (e f \F) = —i-p V x J e /(a:) 
daj v 1 ’ 2 dimF ^ 

J xef 

2 hW)l 

x y ( n ) y ef{x) 

(4.3.1) ee{-i,i} /(F)nJ \j€i(F)nJ ) xeF- 

e=fe: jei(F)nJ) 

(_l)h-(^)nJ| 

= 2 l J ( F ) nJ l 

* E ( II <UE(e'|f"). 

e6 { —l,l} 7 ( F ) nJ \jeI(F)nJ ) 

£ =(e,-: jel(F)nJ) 


In what follows, we identify complex numbers with vectors in M 2 = C and 
measure angles between non-zero complex numbers. 

(4.4) Lemma. Let 0 < r < 1 and 5 > 0 be real numbers and let F C { — 1, l} n be 
a face. Suppose that for every f Eld (5) we have E (e^| F) ^ 0 and, moreover, for 
any K C I{F) we have 

|eUie)| > iffj K ' E l E U.ffil- 

ee{-l,l} K 

Given f Eli (5) and a subset J C {1,... ,n} such that \J\ < d, let f Eld (6) be the 
polynomial obtained from f by changing the coefficient aj of the monomial x J in 
f to —aj and leaving all other coefficients intact. Then the angle between the two 
non-zero complex numbers E (e^| F) and E (e^\F^J does not exceed 

2|o:j| 


Proof. Without loss of generality, we assume that aj ^ 0. 

We note that for any / G ld(S), we have / G U{$). Since E (e^|F) ^ 0 for all 
/ G Id (5), we may consider a branch of InE (e^|F) for / G U{5). 

Let us fix coefficients a/ for I ^ J in 

f( x ) = y a/x 7 

7C{1,... ,n} 
l<\I\<d 

14 


(4.4.1) 



and define a univariate function 


g(a ) = InE (e-'| F) where |ct| < \aj\ 


obtained by replacing aj with a in (4.4.1). 
We obtain 


(4.4.2) 

Let 


d 


d 


9 ' {a) = d^ hlE ^ /|F) = I ^ /|F) ) / E ( e/|F) ' 


da., 


k = |/(F)n J\ < \J\ < d. 
Using (4.3.1) we conclude that 


(4.4.3) 


d 


daj 

On the other hand, 


E (e f \F) 


1 V 

— 2k / -/ 

ee{-l,l} J ( F ) nJ 


E (e^|F«)| 


(4.4.4) 


E (e / |F)| > 


E l E UiOI 

ee{-l,l} 7 ( F ) nJ 


Comparing (4.4.2) - (4.4.4), we conclude that 

d 


b'(«) | = 


da. 


InE (e / |F) 


1 1 

rj-k sj-d 


Then 


InE (e / |F) - InE (e*\F\ = \g {aj) - g (-aj)| < 2\aj\ max \g\a)\ < 


2\aj\ 


a .\< la j 




and the proof follows. □ 

(4.5) Lemma. Let 6 > 0 and 5 > 0 be real numbers such that 65 < n, let F C 
{ — 1, l} n be a face such that dim F < n and suppose that E (e^|F) ^ 0 for all 
f G 14(5). Assume that for any f G 14(5), for any J C (1,... ,n} such that \ J\ < cl, 
and for the polynomial f obtained from f by changing the coefficient aj to —aj 
and leaving all other coefficients intact, the angle between non-zero complex numbers 
E (e f \F) and E does not exceed 9\aj\. 

Suppose that F C {—1, l} n is a face obtained from F by changing the sign of one 
of the coordinates in I+(F) U I~(F). Then G = F UF is a face of { — 1, l} n and for 


95 


T = COS 
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we have 


|E(e'|G)| > | (|E (e / | J F)| + |E (e-^li?)!) 

for any f eU(5). 

Proof. Suppose that F is obtained from F by changing the sign of the i-i h coor¬ 
dinate. Let / be a polynomial obtained from / by replacing the coefficients Qj by 
—ay whenever i e / and leaving all other coefficients intact. Then / G U(5) and 

the angle between E (e^|F) and E F^J does not exceed 

6 m - 65 - 

I: iel 

On the other hand, E (e^\F^\ — E fe^\ f') and 


E (e'|G) = (e>\F) + jE (G|f) = ^E (e>\F) + jE (G>) . 

Thus E ( e f\G) is the sum of two non-zero complex numbers, the angle between 
which does not exceed 65 < n. Interpreting the complex numbers as vectors in 
M 2 = C, we conclude that the length of the sum is at least as large as the length of 
the sum of the orthogonal projections of the vectors onto the bisector of the angle 
between them, and the proof follows. □ 

(4.6) Proof of Theorem 1.2. Let us denote d = deg/. 

One can observe that the equation 



has a solution 6 > 0 for all sufficiently small (3 > 0. Numerical computations show 
that one can choose 

P = 0.55, 


in which case 


9 « 2.748136091. 


Let 

We observe that 

Let 


p _ 0.55 
y/d y/d 


0 < 65 < 6P « 1.511474850 < 7r. 


65 6P 

T — COS = COS -=. 

2 2 y/d 
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In particular, 


r > cos 


0.7277659962. 


Next, we will use the inequality 


(4.6.1) 


(X \ 7T 

cos —== > cos a for 0 < a < — and d > 1. 

Vd) ~ ~ ~ 2 


One can obtain (4.6.1) as follows. Since tan(0) = 0 and the function tanct is convex 
for 0 < a < 7 t/2 , we have 


Vd tan —= < tana for 0 < a < —. 


Integrating, we obtain 


(x 7T 

d In cos —= > In cos a for 0 < a < — 

Vd ~ 2 


and (4.6.1) follows. 

Using (4.6.1), we obtain 

(4-6.2) , 2 „„ 


We prove by induction on to = 0,1,... ,n the following three statements. 

(4.6.3) Let F C { — 1, l} n be a face of dimension m. Then, for any / G U(5), we 
have E (e^| F) ^ 0. 

(4.6.4) Let F C {—1, l} n be a face of dimension to, let / E U(S) and let / be 
a polynomial obtained from / by changing one of the coefficients aj to -aj and 
leaving all other coefficients intact. Then the angle between two non-zero complex 

numbers E (e^| F) and E (e^\ f\ does not exceed 0|aj|. 


(4.6.5) Let F C { —1, l} n be a face of dimension to and let / G U{5). Assuming 
that to > 0 and hence 1(F) ^ 0, let us choose any i G 1(F) and let F + and F~ be 
the corresponding faces of F obtained by fixing x t = 1 and x r — —1 respectively. 
Then 

|E(e / |F)| > - (|E (e / |F+)| + |E (e / |F-)|) . 

2 


If m = 0 then F consists of a single point x G {-l,l} n , so 

E (e / |F) = e f{x) ± 0 

17 



and (4.6.3) holds. Assuming that / is obtained from / by replacing the coefficient 
aj with — aj and leaving all other coefficients intact, we get 


E (e f \F) 
E (ef\Fj 


exp |2ajx J } . 


Since 

|2ajx J | = 2\otj\ < 0\aj\, 

the angle between E (e^| F) and E does not exceed 9\aj\ and (4.6.4) follows. 

The statement (4.6.5) is vacuous for m = 0. 

Suppose that (4.6.3) and (4.6.4) hold for faces of dimension m < n. Lemma 4.5 
implies that if F is a face of dimension m + 1 and F + and F~ are m-dimensional 
faces obtained by fixing Xi for some i G 1(F) to x % — 1 and x % = —1 respectively, 
then 

i / f 17 ~i\ i ( 65\ \E (ef\F+)\ + \E (ef\F~)\ 

|E(eV)l > ( cos y)^- 2 -“ 

= ^(|E(e-'|F+)| + |E (e f \F-)\) 

and the statement (4.6.5) holds for (m + 1)-dimensional faces. 

The statement (4.6.5) for (m + l)-dimensional faces and the statement (4.6.3) 
for m-dimensional faces imply the statement (4.6.3) for (m + l)-dimensional faces. 

Finally, suppose that the statements (4.6.3) and (4.6.5) hold for all faces of 
dimension at most m + 1. Let us pick a face F C {— 1, l} n of dimension m + 1, 
where 0 < m < n. Applying the condition of statement (4.6.5) recursively to the 
faces of F. we get that for any K C 1(F), 

|E( e '|F)| > (L' K| £ |E(e'|F«)|. 

ee{-l,l} JC 

Then, by Lemma 4.4, the angle between two non-zero complex numbers E (e*\F) 
and E (e^l f) does not exceed 


2bj| 


2| OLJ 



< 0\atj\ 


by (4.6.2), and the statement (4.6.4) follows for faces of dimension m + 1. 

This proves that (4.6.3) - (4.6.5) hold for faces F of all dimensions. Iterating 
(4.6.5), we obtain that for any f EU(S), we have 


|Ee / | > 


' E 

*€{- 1 , 1 }" 
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J(x) 



Since for any ie{-l,l} n and for any f EU(S), we have 

n 

1 / 0*01 < \oti\ < n5 < (3n , 

i=i /c {1,,.. ,n} 
iei 

we conclude that 

|Ee / | > r n e _/3n > (0.41) n . 

The proof follows since if / : { —1, l} n —> C is a polynomial with zero constant 
term and 

°' 55 

£(/)Vdeg/’ 

then A/ G W(h). □ 


5. Proofs of Theorems 2.2 and 2.3 


The proofs of Theorems 2.2 and 2.3 are based on the following lemma. 

(5.1) Lemma. Let 

fi x ) = a i yiI 

iet 

be a polynomial such that otj > 0 for all I G IF. Then 


Ee f > J] 

ieT 


e a1 +e~ ai 


Proof. Since 


we have 


e ax = 


e a + e~ a 


+ x 


for x — ±1, 


(5.1.1) Ee / = E Y[e ai ^ = E ]J 

Since 


ieT 


IET 


e a1 + e~ ai 


+ x 


e oti _ g-a/ 


> 0 provided ay > 0 


and 

E (x Jl • • • x /fc ) > 0 for all /i,... , Ip-. 

expanding the product in (5.1.1) and taking the expectation, we get the desired 
inequality. □ 

Next, we prove a similar estimate for functions / that allow some monomials 
with negative coefficients. 
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gnu = Q). 


(5.2) Lemma. Let f(x) = g(x) — h(x) where 

9 ( X ) = Y X ^ H x )=Y xI ’ 

leg ieu 

Suppose that the constant terms of g and h are 0 and that every variable x t enters 
not more than k monomials of f for some integer k > 0. Then 

3A 2 


E e xf > exp | ■ 


(\G\-(k-l)\n\)} for 0 < A < 1. 


Proof. Since E / = 0, by Jensen’s inequality we have 

Ee A/ >1 

and the estimate follows if \Q\ < (k — 1)|"H|. Hence we may assume that \Q\ > 

(k-i)\n\. 

Given a function / : {—1, l} n —> R and a set J C {1,... , n} of indices, we 
define a function (conditional expectation) fj : (—1, —» R obtained by 

averaging over variables X j with j £ J: 

fj (Xi : i £ J) = -j^r f(xi,---,x n ). 


Xj —il 

jeJ 


In particular, fj — f if J — 0 and fj — E/ if J = (1 ,,n}. We obtain the 
monomial expansion of fj by erasing all monomials of / that contain Xj with 
j £ J. By Jensen’s inequality we have 

(5.2.1) Ee^ > Ee A ^ J for all real A. 

Let us choose a set J of indices with \ J\ < \TL\ such that every monomial in h(x) 
contains at least one variable Xj with j G J. Then every variable Xj with j E J is 
contained in at most ( k — 1) monomials of g(x) and hence fj is a sum of at least 
\g\ — {k — 1)\TL\ monomials. 

From (5.2.1) and Lemma 5.1, we obtain 

, p \ , -X X \G\~(k-l)\U\ , X 2\ \Q\-{k-l)\H\ 

Ee x f > Ee x f J > ( 6 ) > 1 + - 


Using that 
(5.2.2) 


2 

X f X \ 

ln(l + x) > x -— = x (1 — — j for x > 0, 

\ £ / 


we conclude that 
A 2 


Ee xf > exp | 


as desired. 


A 2 A 1 f3A 2 

2 v l- T )(|£|-(^-l)l^l)f > exp l — {\g\- \k- l)\H\) 


□ 


Now we are ready to prove Theorem 2.3. 
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(5.3) Proof of Theorem 2.3. Let Xq G (—1, l} n , Xq — (£i,... ,f n ) be a maxi¬ 
mum point of /, so that 

max f(x) = f(x 0 ). 

*€{- 1 * 1 }" 

Let us define / : {—1, l} n —> R by 


f(x 1 : ...,X n ) = f (fiZi, . . . , inX n ) . 

Then 

max fix)— max fix), Ee A ^ = Ee A ^ 

*€{- 1 , 1 }" *€{ — 1 , 1 }" 

and the maximum value of / on the cube (—1, l} n is attained at u — (1,... , 1). 
Hence without loss of generality, we may assume that the maximum value of / on 
the cube {—1, l} n is attained at u — (1,... , 1). 

We write 

f(x) = g(x) — h(x) where g(x ) = x 7 and h(x) — x J 

leg ieu 

for some disjoint sets Q and % of indices. Moreover, 

max f{x) = f(u ) = \Q\ - \n\ > - \fF\. 

*€{-i,i} n k 

Since 

\g\ + \H\ = \n 

we conclude that 

ob — 1 1 

isi > ~2t\?\ and m < 

By Lemma 5.2, 

Ee y > exp|5L(|a|-(fc-l)|W|)} > exp|?L|Jf|| 


as desired. □ 

To prove Theorem 2.2, we need to handle negative terms with more care. 

(5.4) Lemma. Let fix ) = g(x) — h(x) where 

g{x) = '^^x. 1 , hix) — ^^^, GnlL — ^ 
leg ieu 


and 


101 > 1 ^ 1 - 
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Suppose that the constant terms of g and h are 0 and that the supports JeH of 
monomials in h(x) are pairwise disjoint. Then 


Ee A/ > expj^ (VW\~ VW\) 2 } for 0 < A < 1. 


Proof. By Jensen’s inequality we have 

Ee xf > exp {AE /} = 1, 

which proves the lemma in the case when \Q\ — \TL\- Hence we may assume that 

\g\>\n\. 

If \TL\ = 0 then, applying Lemma 5.1, we obtain 


Ee A/ =Ee Afl > 


e A + e- A \ |e| . / A 2 \ 161 


a 1 + T 


Using (5.2.2), we conclude that 


Ee A/ > exp | 




which proves the lemma in the case when \H\ =0. Hence we may assume that 

\g\>\n\>o. 

Since the supports I e TL of monomials in h are pairwise disjoint, we have 


(5.4.1) 


Ee Xh = E 


, / e A i e -A\l^l 

^ - (HH 


Let us choose real p, q > 1, to be specified later, such that 

- + - = 1 . 

P d 

Applying the Holder inequality, we get 


Ee Xg/p = E (e xf/p e xh/p ) < (Ee xf ) /P (E 


A/U/P fqh/p 


and hence 


(Ee Xg /p) P 

Ee xf > -a- 

(E e Mh/py lq 
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Applying Lemma 5.1 and formula (5.4.1), we obtain 

' 6 Vp _|_ e ~ x /p \ ^ p / e M/p _|_ e -M/p \ 


Ee A/ > 


Since 


we obtain 


o p 3 ' -t- P X 'T'~ 

x */2 > +e > 1 + ^L for x>0, 
2 2 “ 


_ w / a 2 \ ls|p f A 2 ^n 

Ee/ ^ ,i+ vj exp r^r} 


Applying (5.2.2), we obtain 


E e xf > exp | 


^\G\_X 2 q\n\ A 4 |£| 

2 p 


Let us choose 


P = 


v'W\ 


2 p 8p 3 J 

■M 


VW\-eW\ 


and q = 


VW\ 


Then 




I A 2 / - -\ 2 / A 2 ( y/\g\ - y/\H\ t 

= exp Tb/iel-yiwj 1-- 


> exp 


2 
3 A 2 


4 VW\ 


VW\ - VW\ 


and the proof follows. □ 

(5.5) Lemma. Let f(x ) = g(x) — h(x) where 

g(x) = ^ x 7 , h(x) — ^ x 7 , Q n Li = 0 
leg ien 

and 

\s\ > \n 

Suppose that the constant terms of g and h are 0, that every variable Xi enters at 
most two monomials in h(x) and that if Xi enters exactly two monomials in h(x) 
then Xi enters at most two monomials in g(x). Then for 0 < A < 1, we have 

3A 2 


Ee^ > exp 
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Proof. We proceed by induction on the number k of variables Xi that enter exactly 
two monomials in h(x). If k = 0 then the result follows by Lemma 5.4. 

Suppose that k > 0 and that x % is a variable that enters exactly two monomials 
in h(x) and hence at most two monomials in g(x). As in the proof of Lemma 5.2, let 
fi : {0, l} n_1 —» R be the polynomial obtained from / by averaging with respect 
to Xi . As in the proof of Lemma 5.2, we have 

Ee A/ > Ee A/i where fcfx) = ^ x 1 - ^ x 1 

ieQi ieUi 

and Q % , respectively 'H l . is obtained from Q. respectively PL, by removing supports 
of monomials containing Xi . In particular, 

\Hi\ = \PL\ — 2 and \Qi\>\Q\-2. 

Applying the induction hypothesis to fi, we obtain 

Ee A/ >Ee A ^ > exp (y\g7\ - vlW)"} 

> ex p{^~ [vw\ - 2- \/w\ - 2 )} > exp {^“ (vw\ - vm )} 

and the proof follows. □ 

Finally, we are ready to prove Theorem 2.2. 

(5.6) Proof of Theorem 2.2. As in the proof of Theorem 2.3 of Section 5.3, 
without loss of generality we may assume that the maximum of / is attained at 

u = ( 1 , •• • , 1 ). 

We write 

f(x) — g(x ) — h(x) where g(x) — and h(x) — x 1 

leg ieu 

for some disjoint sets Q and PL of indices. Moreover, 

max f[x) = f(u) = \Q\ - \U\ = 5\T\. 
xe{-i,i} n 

Since 

\g\ + \H\ = \n 

we conclude that 

(5.6.1) \g\ = l±l\F\ and \H\ = ^\T\. 
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For i = 1.... . n let: /ij be the number of monomials in g that contain variable i 
and let be the number of monomials in h that contain x t . Then 

(5.6.2) [j,f + <4 for i — 

If for some i we have /if < then for the point u l obtained from u by switching 
the sign of the i-tli coordinate, we have 


/(«i) = del - 2 vt) - (m - 2ft) = \s\ - \n\ + 2 _ M +) > / (m) , 

contradicting that u is a maximum point of /. Therefore, 

ixf > n~ for i = l,...,n 
and, in view of (5.6.2), we conclude that 

/j~ < 2 for i = 1,... , n and if /i~ = 2 then fxj = 2. 


By Lemma 5.5, 

Ee A/ > exp | \f\G~\ — |. 

Using (5.6.1), we deduce that 



which completes the proof. 


□ 
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